Merging Applicability Domains for in Silico Assessment of Chemical Mutagenicity

نویسندگان

  • Ruifeng Liu
  • Anders Wallqvist
چکیده

Using a benchmark Ames mutagenicity data set, we evaluated the performance of molecular fingerprints as descriptors for developing quantitative structure-activity relationship (QSAR) models and defining applicability domains with two machine-learning methods: random forest (RF) and variable nearest neighbor (v-NN). The two methods focus on complementary aspects of chemical mutagenicity and use different characteristics of the molecular fingerprints to achieve high levels of prediction accuracies. Thus, while RF flags mutagenic compounds using the presence or absence of small molecular fragments akin to structural alerts, the v-NN method uses molecular structural similarity as measured by fingerprint-based Tanimoto distances between molecules. We showed that the extended connectivity fingerprints could intuitively be used to define and quantify an applicability domain for either method. The importance of using applicability domains in QSAR modeling cannot be understated; compounds that are outside the applicability domain do not have any close representative in the training set, and therefore, we cannot make reliable predictions. Using either approach, we developed highly robust models that rival the performance of a state-of-the-art proprietary software package. Importantly, based on the complementary approach used by the methods, we showed that by combining the model predictions we raised the applicability domain from roughly 80% to 90%. These results indicated that the proposed QSAR protocol constituted a highly robust chemical mutagenicity prediction model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

In-silico prediction of Cellular Responses to Polymeric Biomaterials from Their Molecular Descriptors

In this work quantitative structure activity relationship (QSAR) methodology was applied for modeling and prediction of cellular response to polymers that have been designed for tissue engineering. After calculation and screening of molecular descriptors, linear and nonlinear models were developed by using multiple linear regressions (MLR) and artificial neural network (ANN) methods. The root m...

متن کامل

Application of a chemical reactivity database to predict toxicity for reactive mechanisms

Covalent binding of xenobiotic electrophiles to nucleophilic endogenous biomolecules, e.g. peptides or DNA, is a common molecular initiating event, leading to potentially irreversible toxic effects such as enhanced acute toxicity, skin sensitisation, or mutagenicity. This knowledge provides the basis for the in silico prediction of these toxicities. The potential for a chemical to be reactive c...

متن کامل

Applicability Domains for Classification Problems: Benchmarking of Distance to Models for Ames Mutagenicity Set

The estimation of accuracy and applicability of QSAR and QSPR models for biological and physicochemical properties represents a critical problem. The developed parameter of "distance to model" (DM) is defined as a metric of similarity between the training and test set compounds that have been subjected to QSAR/QSPR modeling. In our previous work, we demonstrated the utility and optimal performa...

متن کامل

Application of in silico modelling to estimate toxicity of migrating substances from food packaging.

This study derived toxicity estimates for a set of 136 chemical migrants from food packaging materials using in silico (computational) modelling and read across approaches. Where available, the predicted results for mutagenicity and carcinogenicity were compared with published experimental data. As the packaging compounds are subject to safety assessment, the migrating substances were more like...

متن کامل

بررسی تاثیر عصاره های متانولی گیاه Avicennia marina بر رشد و تکثیر لنفوسیت و جهش زایی آن ها با استفاده از آزمون ایمز و روش های بیوانفورماتیکی

Background and purpose: Avicennia marina (family Acanthaceae) has been used as traditional medicine in Iran to treat some diseases such as ulcers, rheumatism and burns. The present study investigated the in silico and in vitro mutagenicity of the fruit, leaf, seed and stem extracts of Avicennia marina and their effects on human peripheral blood mononuclear cells (PBMC) proliferation. Materia...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of chemical information and modeling

دوره 54 3  شماره 

صفحات  -

تاریخ انتشار 2014